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Abstract 



We introduce a segmentation algorithm to probe temporal organization 
of heterogeneities in human heartbeat interval time series. We find that the 
lengths of segments with different local values of heart rates follow a power-law 
distribution. This scale-invariant structure is not a simple consequence of the 
long-range correlations present in the data. We also find that the differences in 
mean heart rates between consecutive segments display a common functional 
form, but with different parameters for healthy individuals and for patients 
with heart failure. This finding may provide information into the way heart 
rate variability is reduced in cardiac disease. 



A time series is stationary if the mean, standard deviation and all higher moments, as 
well as the correlation functions, are invariant under time translation |]J. Signals that do not 
obey these conditions are nonstationary. Nonstationarity is a prominent feature of biological 
variability that can be associated with regimes (segments) of different statistical properties. 
The borders between different segments can be gradual or abrupt (Fig. [I|). 

A major problem in contemporary physiology is the presence of nonstationarity in time 
series generated under free-running conditions f2|. Physiological signals obtained under 
widely-varying conditions raise serious challenges to both technical and fundamental as- 
pects of time series analysis. By filtering out effects of nonstationarity, much work has 
focused on "intrinsic properties" of physiological signals 0. This approach is based on the 
implicit assumption that the nonstationarity arises simply from changes in environmental 
conditions — e.g., different daily activities — so environmental "noise" could be treated 
as a "trend" and distinguished from the more subtle fluctuations that may reveal intrinsic 
correlation properties of the dynamics. Indeed, important scale-invariant features in physio- 
logical processes were recently revealed after filtering out masking effects of nonstationarity 
||. However, nonstationarity itself is also an important feature of physiological time series 
and is known to change from healthy to pathological conditions ||, suggesting more than 
only enviromental conditions are reflected in the phenomena. Thus one would expect that 
there is a non-trivial structure associated with the nonstationarity in physiological signals, 
which may change with disease. To test this hypothesis we focus on one statistical property, 
the mean heart rate, which is related to physiologic responses and is commonly used for 
medical evaluation. 

The problem is to partition a nonstationary time series, which is composed of many 
segments with different mean value, in such a way as to maximize the difference in the mean 
values between adjacent segments. We apply the following procedure: we move a sliding 
pointer from left to right along the signal. At each position of the pointer, we compute the 
mean of the subset of the signal to the left of the pointer (/xi e f t ) and to the right (/i r i g ht)- To 
measure the difference between /ii c f t and fright, we compute the t-statistic ||: 
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where sd is the pooled variance [0. 

We next determine the position of the pointer for which t reaches its maximum value, 
t max , and compute the statistical significance of £ max ||. We check if this significance exceeds 
a given threshold Vq. If so, then the signal is cut at this point into two subsequences; 
otherwise the signal remains undivided. If the sequence is cut, the procedure continues 
recursively for each of the two resulting subsequences created by each cut. Before a new cut 
is accepted, we also compute t between the right-hand new segment and its right neighbor 
(obtained by a previous cut) and the t between the left-hand new segment and its left 
neighbor (also obtained by a previous cut) and check if both values of t have a statistical 
significance exceeding Vq. If so, we proceed with the new cut; otherwise we do not cut. This 
ensures that all resulting segments have a statistically significant difference in their means. 
The process stops when none of the possible cutting points has a significance exceeding Vq, 
and we say that the signal has been segmented at the "significance level V§ (Fig. 0). 

Our method leads to partitioning of a time series into segments with well-defined means, 
each significantly different from the mean of the adjacent segments (Fig. |XJ) . This allows us 
to probe the nonstationarity in a signal through the statistical analysis of the properties of 
the segments. 

Here we consider 47 datasets from 18 healthy subjects, 17 records of cosmonauts during 
orbital flight and 12 patients with congestive heart failure . We separately analyze 6-hour 
long subsets of each dataset, corresponding to the periods when the subject is awake or 
sleeping. Figure [l| shows a representative dataset of a healthy subject, and a subject with 
heart failure. Superposed on the interbeat interval series, we also plot the segments obtained 
by means of our segmentation algorithm. 

To quantify the nonstationarity in heart rate variability, we study the statistical prop- 
erties of the segments corresponding to parts of the signal with significantly different mean 
values. To characterize the segments, we analyze two quantities: (i) the length of the seg- 
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ments; (ii) the absolute values of the differences between the mean values of consecutive 
segments, which we call jumps. 

(i) Distribution of segment lengths — Healthy subjects typically exhibit nonstationary be- 
havior associated with large variability, trends, and segments with large differences in their 
mean values, while data from heart failure subjects are characterized by reduced variability 
and appear to be more homogeneous (Fig. |l|) ||. Thus, one might naively expect that 
signals from healthy subjects will be characterized by a large number of segments, while sig- 
nals from heart failure subjects will exhibit a smaller number of segments (i.e., the average 
length of the segments for healthy subjects could be expected to be smaller than for heart 
failure subjects). 

To test this hypothesis, we apply the segmentation algorithm to 6-hour records of inter- 
beat intervals during daily activity, and find that for each healthy subject the distribution 
of segment lengths is well described by a power law with an identical exponent, indicating 
absence of a characteristic length for the segments. Surprisingly, we find that this power 
law remains unchanged for records obtained from cosmonauts during orbital flight (under 
conditions of microgravity) and for patients with heart failure (Fig. ^). A similar common 
type of behavior is also observed from 6-hour records during sleep for all three groups [JTU 



To verify the results of the segmentation procedure, we perform several tests. First, we 
check the validity of the observed power law in the distribution of segment lengths. We gen- 
erate a surrogate signal formed by joining segments of white noise with standard deviation 
a = 0.5, and mean values chosen randomly from the interval [0, 1]. We choose the lengths of 
these segments from a power-law distribution with a given exponent. Even when the differ- 
ence between the mean values of adjacent segments is smaller than the standard deviation 
of the noise inside the segments, we find that our procedure partitions the surrogate signal 
into segments with lengths that reproduce the original power-law distribution [Fig. |](a)]. 
This test shows that the distributions obtained after segmenting surrogate data with similar 
values of their exponents, appear clearly different from each other, making more plausible 
that the distributions obtained for the lengths of the segments for the healthy, cosmonauts 



and congestive heart failure subjects (Fig.|3|) follow indeed an identical distribution. 

Second, we test if the observed power-law distribution for the segment lengths is simply 
due to the known presence of long-range correlations in the heartbeat interval series ||TT] . 



For that, we generate correlated linear noise [12] with the same correlation exponent as the 



heartbeat data. We find that the distribution of segment lengths obtained for the linear 
noise differs from the distribution obtained for the heartbeat data [Fig. §](b)]. For the noise, 
the distribution decays faster, which means that these signals are more segmented than the 
heart data. In fact, for different linear noises with a broad range of correlation exponents, we 
do not find power-law behavior in the distribution of the segments. Thus we conclude that 
the linear correlations are not sufficient to explain the power-law distribution of segment 
lengths in the heartbeat data. 

(ii) Differences between the mean values of consecutive segments (jumps) — Different healthy 
records can be characterized by different overall variance, depending on the activity and the 
individual characteristics of the subjects. Moreover, subjects with heart failure exhibit 
interbeat intervals with lower mean and reduced beat-to-beat variability (lower standard 
deviation). Thus one can trivially assume that these larger jumps in healthy records are due 
only to the fact that their average standard deviation is larger [Fig. [TJ(a)(b)]. In order to 
systematically compare the statistical properties of the jumps between different individuals 
and different groups, we normalize each time series by subtracting the global average (over 6 
hours) and dividing by the global standard deviation. In this way, all individual time series 
have zero mean and unit standard deviation [Fig. 0(c) (d)]. Such a normalization does not 
affect the results of our segmentation procedure. 

We find that both the healthy subjects and the cosmonauts follow identical distribu- 
tions, but the distribution of the jumps obtained from the heart failure group are markedly 
different — centered around lower values — indicating that, even after normalization, there 
is a higher probability for smaller jumps compared to the healthy subjects [Fig. |^(a)]. Note 
that the distributions for all groups appear to follow an identical homogeneous functional 
form, so we can collapse these distributions on top of each other by means of a homogeneous 



transformation [Fig. |5|(b)]. The ratio between the scaling parameters used in this transfor- 
mation gives us a factor by which this feature of the heartrate variability is reduced for the 
subjects with heart failure as compared to the healthy subjects. This finding indicates that, 
although the heartrate variability is reduced with disease, there may be a common structure 
to this variability, reflected in the identical functional form. These observations agree with 
previously reported results for the distribution of heartbeat fluctuations obtained by means 



of wavelet and Hilbert transforms [13 



In summary, we present a new method to probe the nonstationarity of a signal by par- 
titioning it into segments with different mean values. We find a scale-invariant structure 
in the nonstationarity of a time series representative of a complex dynamics, namely the 
human heartbeat. This structure is characterized by a power-law distribution of the lengths 
of segments with a scaling exponent which does not change under certain pathological con- 
ditions and cannot be explained by the presence of correlations in the data. We find also a 
common structure to the jumps between consecutive segments, with a change in the scaling 
parameters with disease. 
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Peng, and Z. Struzik for helpful discussions and suggestions, grants BIO99-0651-CO2-01 
(from the Spanish Government) and NIH/NCRR (P41RR13622), NASA, and the Mathers 
Charitable Foundation for support. 
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FIG. 1. (a) Plot of 20,000 interbeat intervals (pa 6 hours) for a healthy subject (upper curve) 
and a subject with heart failure (bottom curve). Note the larger variability and patchiness for the 
healthy record, (b) Magnification of a small fraction (2000 beats) of the signals in (a), (c) Same 
signals as displayed in (a) after subtracting the global average and dividing by the global standard 
deviation; after this normalization both signals appear very similar, (d) Magnification of a small 
fraction (2000 beats) of the signals in (c). 



9 




FIG. 2. (a) An artificial time series f(x) composed of three segments with different mean values, 
(b) Values of the statistic t, defined in Eq. (p]), obtained by moving the pointer along the time 
series. Note that i max is reached at x\. We find that if V{t max ) > Vo = 95%, and so we cut the 
series at x\. (c) We iterate the procedure with the segment [0, x{\. We find that V(t max ) > 95% 
and we also find that the significance of t computed between [x%, x\] and [x2, 2000] is greater than 
95%, so the series is cut at x^. (d) We iterate the procedure with the segment [xi,2000]. Now, 
Vitxasx) < 95%, so this segment is not cut. 
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FIG. 3. Probability of finding segments with a length I larger than a given value for the 
segments obtained from all subjects in the healthy, cosmonauts and heart failure groups during 
during daily activity. The significance level is fixed to Vq = 95%, and the imposed minimum length 
of the segments is £q = 50 beats. For all three groups we find a power law in the distribution of 
segment lengths with exponent [5 f» 2.2, and we find that depends on £q and on Vq. For all Iq 
and Vq, the value of (3 is the same for each three groups. 
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FIG. 4. (a) Testing the validity of the observed power-law behavior in the distribution of 
segment lengths. We generate a surrogate signal formed by joining segments of white noise with 
standard deviation a = 0.5 and average values chosen randomly from the interval [0, 1]. We chose 
the lengths of these segments from a power-law distribution with a given exponent j3. The test 
shows that the distributions obtained after segmenting the surrogate data generated from power-law 
distributions with nearby values of their exponents appear clearly separated. This suggests that the 
distributions for the healthy, cosmonauts and congestive heart failure subjects in Fig. || are indeed 
identical, (b) Testing the effect of correlations in the heartbeat fluctuations on the segmentation. 
We generate 10 realizations, each with length of 26,000 points, of a linear Gaussian-distributed 



correlated noise with an exponent a = 1.1 [12]. This exponent is calculated using the detrended 



fluctuation analysis method and is identical to the exponent a observed for the heartbeat data 
[11]. The distribution of segment lengths for this correlated noise does not follow the power law 
found for the heartbeat data. This test suggests that the observed scale-invariant behavior in the 
distributions of segment lengths in the heartbeat is not simply due to the correlations. According to 
the results in (a), the differences found between heartbeat data and correlated noise are significant. 
To verify that the curvature found in the distribution of segments for the noise is not due to finite 
size effects, we also repeated the test with longer realizations of the noise (1,000,000). 
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FIG. 5. (a) Probability distribution of the absolute value of the difference between the mean 
values ('jumps') of consecutive segments. Both healthy and cosmonaut subjects follow an iden- 
tical distribution while the heart-failure subjects follow a quite different distribution with higher 
probability for small jumps consistent with reports of smaller variability in heart failure subjects 
[]|]. All distributions are normalized to unit area, (b) Same probability distributions as in (a), 
after rescaling P(s) by -P m ax> and s by 1/P max . This homogeneous transformation preserves the 
normalization to unit area. The data points collapse onto a single curve. 
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